On distinguishability, orthogonality, and violations of the second law: 
contradictory assumptions, contrasting pieces of knowledge 
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Two statements by von Neumann and a thought-experiment by Peres prompts a discussion on the notions of 
one-shot distinguishability, orthogonality, semi-permeable diaphragm, and their thermodynamic implications. 
In the first part of the paper, these concepts are defined and discussed, and it is explained that one-shot dis- 
tinguishability and orthogonality are contradictory assumptions, from which one cannot rigorously draw any 
conclusion, concerning e.g. violations of the second law of thermodynamics. In the second part, we analyse 
what happens when these contradictory assumptions comes, instead, from two different observers, having dif- 
ferent pieces of knowledge about a given physical situation, and using incompatible density matrices to describe 
it. 

PACS numbers: 03.65.Ca,65.40.Gr,03.67.-a 



INTRODUCTION: VON NEUMANN (AND PERES) ON 
ORTHOGONALITY AND THE SECOND LAW 



In § V.2 of von Neumann's Mathematische Grundlagen der 
Quantenmechanik 1 1] (see also |2]) we find the two following 
propositions: 1 

"[T]wo states </>, if/ [. . . ] can certainly be separ- 
ated by a semi-permeable diaphragm if they are (1) 
orthogonal"; 

and the converse 

"[I]f <f>, <A are not orthogonal, then the assumption 

of such a semi-permeable diaphragm contradicts (2) 

the second law [of thermodynamics]". 

These statements concern thermodynamics and the notions 
of distinguishability and orthogonality. 

Von Neumann proved the first statement above in an ana- 
lysis involving "thermodynamic considerations" (the same 
considerations by which he derived his entropy formula). 
However, he did not actually prove the second statement, but 
rather the converse of the first one, viz.: if two "states" can be 
separated by semi-permeable diaphragms, then they must be 
orthogonal. 

It was Peres |3] that gave a seemingly direct demonstration 
of the second statement (|2j by means of a thought-experiment 
with 'quantum gases' in which he explicitly assumes the non- 
orthogonality of two quantum "states", their separability by 
semi-permeable diaphragms, and from this assumptions cre- 
ates a thermodynamic cycle which violates the second law. 



'Electronic address: mana@imit.kth.se 

1 "[Z]wei Zustande (p, ift [. . . ] durch semipermeable Wande bestimmt ge- 
trennt werden konnen, wenn sie orthogonal sind", and "[S]ind </>, i/f nicht 
orthogonal, so widerspricht die Annahme einer solchen semipermeablen 
Wand dem zweiten Hauptsatz". 



More precisely, the thought-experiment involves two observ- 
ers: one making the non-orthogonality assumption, the other 
performing the separation. 

The interplay of orthogonality, distinguishability, thermo- 
dynamics, and multiplicity of observers is quite interesting; 
therefore we want to discuss and analyse it in varying depth, 
mainly with paedagogical purposes. The paper is divided into 
two main parts, reflecting two main perspectives. 

In the first part we offer a discussion on the three concepts 
of preparation and measurement procedure and "one- shot" 
distinguishability, and of their mathematical representatives: 
density matrix, positive-operator-valued measure, and ortho- 
gonality. First, we make clear that 'orthogonality' is simply 
the mathematical counterpart of 'one-shot distinguishability'; 
thus, one cannot assert or deny the one without asserting or 
denying the other as well. Second, we show how the idea of 
a semi-permeable membrane which can separate two prepara- 
tions ("states") is just a particular realisation of a measurement 
procedure which can distinguish, in one shot, those prepara- 
tions. This will provide an occasion to discuss the relation of 
these concepts to thermodynamics. The principal conclusion 
of the first part will be that the second statement Q — and 
its demonstration by Peres — contains a contradiction in its 
premise; i.e., it has the logical form '(A A -A) => B' . The 
contradiction in the premise is the assumption of distinguish- 
ability and non-distinguishability at the same time. 

The contradiction in Peres' experiment, however, can also 
be conceived as a situation in which two scientists use two 
different density matrices to analyse the same physical phe- 
nomenon. This provides an illustration of the fact that a dens- 
ity matrix adopted to describe a preparation is always depend- 
ent on an observer's particular knowle dge a bout that prepar- 

ation JlBBElllllElEilEl El El El El El El, 

as well as on the sets of preparations and measurements con- 
sidered by the observer H El El El Ell • It also illustrates 
what can happen when the two observers' have and use dif- 
ferent pieces of knowledge, hence different density-matrix as- 
signments. For example, the second law may seemingly be 
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violated for one of the observers. The fact that a mathem- 
atical description always depends on one's particular know- 
ledge of a phenomenon, however, is not only true in quantum 
mechanics, but in classical physics as well. A particular case 
within thermodynamics was shown by Jaynes' 1 20] by means 
of a thought-experiment which is very similar to Peres' and in 
which the same seeming violation of the second law appears 
according to one of the observers. For this reason, we shall 
juxtapose Peres' demonstration to Jaynes', hoping that they 
will provide insight into each other. 

We assume that our reader has a working knowledge of 
quantum mechanics and thermodynamics (we also provide 
some footnotes on recent developments of the latter, today 
better called 'thermomechanics' [21, since they are 

apparently largely unknown to the quantum-physics com- 
munity.) We also want to emphasise that this paper is not 
directly concerned with questions about the relation between 
thermodynamics and statistical mechanics, nor to questions 
about "classical" or "quantum" entropy formulae. Since our 
discussions will regard the second law of thermodynamics, the 
reader probably expects that entropy will enter the scene; but 
his or her expectations will not be fulfilled. Peres', von Neu- 
mann's, and Jaynes' demonstrations are based on cyclic pro- 
cesses, which start from and end in a situation described by 
the same thermodynamic state. 2 Assuming the entropy to be 
a state variable, its change is naught in such processes, inde- 
pendently of its mathematical expression. The second law of 
thermodynamics assumes then the form 3 



Q < (cyclic processes), 



(3) 



where Q is the total amount of heat absorbed by the thermo- 
dynamic body in the process. This is the entropy-free form 
we are going to use in this paper. 



1. PART I: CONTRADICTORY PREMISES AND 
VIOLATIONS OF THE SECOND LAW 

Alice felt even more indignant at this suggestion. 
"I mean," she said, "that one can't help growing 
older." 1 351 



1.1. Preparations and density matrices, measurements and 
positive-operator-valued measures 

We begin by informally recalling the definitions of 'prepar- 
ation procedure' and 'density matrix', 'measurement proced- 
ure' and 'positive-operator-valued measure', and 'one-shot 



distinguishability' and 'orthogonality' . In our definitions we 
follow Ekstein QlH, Giles ^J3<J|4(|, Foulis and Ran- 
dall GHEES, Band and Park 00, and Peres USUI; 
the reader is referred to these references for a deeper discus- 
sion (see also 0, B d G3 GIH E3 EH El and the 
introductory remarks in Komar 15 311 ). 

In quantum theory, a preparation procedure (or 'prepara- 
tion' for short) is "an experimental procedure that is com- 
pletely specified, like a recipe in a good cookbook" (H, p. 
12, cf. also p. 424). It is usually accompanied by a meas- 
urement procedure, which can result in different outcomes, 
appropriately labeled. The probabilities that we assign to the 
obtainment of these outcomes, for all measurement and pre- 
paration procedures, are set as postulates in the theory and 
encoded in its mathematical objects, described below. Prepar- 
ation and measurement procedures are not mathematical ob- 
jects. As an oversimplified example, the instructions for the 
set-up and triggering of a low-intensity laser constitute a pre- 
paration procedure, and the instructions for the installation of 
a detector constitute a measurement procedure. 

A preparation procedure is mathematically represented by a 
density matrix 4 g, whose mathematical properties we assume 
well known to the reader |3j, |52j, 15511 . A measurement pro- 
cedure is instead mathematically represented by a positive- 
operator-valued measure (POVM) viz., a set of pos- 
itive (semi-definite) matrices, each associated to a particu- 
lar measurement outcome, which sum to the identity mat- 
rix I3l l52ll56ll57ll58ll . These two mathematical objects encode 
our knowledge of the statistical properties of their respective 
preparation and measurement: when we perform an instance 
of the measurement represented by {£;} on an instance of the 
preparation represented by g, the probability p t assigned to 
the obtainment of outcome i is given by the trace formula 
pi = trdpE,). 

If we wish to specify not only the probabilities of the vari- 
ous outcomes of a measurement, but also its effect on the pre- 
paration, we associate to the measurement procedure a com- 
pletely positive map (CPM) |E! E3 E3 El El We shall 
not use the full formalism of completely positive maps here, 
however, but only the special case where the CPM is a set of 
projectors {II j] and the effect on the density matrix g when 
result i is obtained is given by 



Qi = 



Iliglli 
tr(/7,£/7,)' 



(4) 



1.2. One-shot distinguishability and orthogonality 



2 The concept of 'state' in thermomechanics may include not only the in- 
stantaneous values of several (field) variables, but even their histories, 
or suitably defined equivalence classes thereof. This concept has an in- 
teresting historical development. Cf. e.g. Truesdell et al. 1 24, Preface], 
No Upl llSl. Willems (2J, Coleman et al. 13 EjDel Piero et 
al. QlEl~§ilhavy Q. 

3 See Serrin E3ll for a keen analysis of the second law for cyclic processes. 



Suppose a physicist has realised one instance of a prepar- 
ation procedure, choosing between two possible preparation 



4 The original and more apt term was 'statistical matrix', which has unfortu- 
nately been changed to 'density matrix' apparently since Wigner l54l : cf. 
Fano 01. 
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procedures. We do not know which of the two, but we can 
perform a measurement on the instance and record the out- 
come. Suppose there exists a measurement procedure such 
that some (at least one) of its outcomes have vanishing prob- 
abilities for the first preparation and non-vanishing probabilit- 
ies for the second, while the remaining (at least one) outcomes 
have vanishing probabilities for the second preparation and 
non-vanishing probabilities for the first. This means that by 
performing a single instance of this measurement procedure 
we can deduce with certainty which preparation was made, by 
looking at the outcome. In this case the two preparations are 
said to be one-shot distinguishable (cf. 1 19]). The fact that 
some sets of preparation procedures are one-shot distinguish- 
able while others are not is the basic reason why quantum 
mechanics is a probabilistic theory. 

If two preparation procedures are one-shot distinguishable, 
it is then easy to separate them. By this we mean that if we 
are presented with many instances of these preparations, we 
can for each instance make a measurement and tell the prepar- 
ation, and thus separate the instances of the first preparation 
from the instances of the second preparation. But the converse 
is also true: if we can separate with certainty the two groups 
of instances, it means that we can distinguish with certainty 
the preparation of each instance. 

Let us see how the one-shot-distinguishability prop- 
erty is represented mathematically. The two one-shot- 
distinguishable preparation procedures of the above example 
are represented by density matrices and if/, and the measure- 
ment by a POVM {A^}. According to the definition of one- 
shot distinguishability given above, we must be able to write 
this POVM as \A k ) = {E t , Fj}, with 

tr(0E ; ) = and tr(i^) + for all E h 
trtyFj) = and tr(0F ; ) * for all Fj. ^ 

This is the mathematical form of our definition of one-shot 
distinguishability. 

An important consequence of the equations above is the fol- 
lowing: the density matrices and ip must be orthogonal, viz. 
tr(<f»f/) = 0. 6 We prove this simple theorem in the Appendix. 

Thus, the fact that two preparation procedures are one-shot 



Note the difference between a preparation (or measurement) procedure, 
and the realisation of an instance thereof. The first is a set of instructions, 
a description, the latter is a single actual realisation of those instructions. 
This terminology may help avoiding the confusion which some authors 
(see e.g. 16(1 16 Illici t still make between 'ensemble' and 'assembly', as 
defined and conceptually distinguished by e.g. Peres Q pp. 25, 59, 292] 
(see also Ballentine 1 48, p. 361]). The term 'ensemble' is used with so 
different meanings in the literature (see e.g. Hughston et al. l63ll for yet an- 
other, though self-consistent, usage), that we prefer to avoid it altogether. 
Moreover, Bayesian probability theory (see e.g. Il(jl64ll ) makes the con- 
ceptual usefulness of this term questionable or at least obsolete. 
6 This expression is the usual definition of orthogonality, i.e. vanishing scalar 
product, between vectors: in our case the density matrices are considered 
as vectors in a real vector space of Hermitean matrices, with tr(<f>ifr) as the 
scalar product. 



distinguishable is mathematically represented by the ortho- 
gonality of their associated density matrices. Or, to put it an- 
other way, orthogonality is the mathematical representation of 
one-shot distinguishability. The converse of the mathematical 
theorem is also true: if two density matrices are orthogonal, 
then there exists a POVM with the properties Q above. But 
to conclude that the represented preparation procedures are 
one-shot distinguishable, we need first to assume that there 
physically exists a measurement procedure corresponding to 
that POVM. 7 . 

To prove this theorem (and its converse), thermodynamic 
arguments are not needed. It just follows mathematically 
from Eqns. which represent a probabilistic property of a 
measurement. So we begin to see that von Neumann's state- 
ment 0, in which he seems to derive orthogonality from 
separability by semi-permeable diaphragms, would not really 
need thermodynamic considerations. But we can make this 
more precise only after we have discussed the notion of a 
semi-permeable diaphragm, which appears in statement 0. 
This will be done in a moment; before then, we want to 
make some final remarks about distinguishability and about 
linguistic details. 

It may be the case that two preparation procedures cannot 
be distinguished by means of a single measurement instance, 
but can still be distinguished with arbitrary precision by study- 
ing their statistical properties, i.e. by analysing the results of 
an adequately large number of diverse measurements on sep- 
arate instances of these preparations. 8 They are thus distin- 
guishable, but not in one-shot. They are represented in this 
case by different, but non-orthogonal, density matrices. On 
the other hand, two preparation procedures are represented 
by the same density matrix when they have exactly the same 
statistical properties (with respect to a given set of measure- 
ment procedures) and so cannot be distinguished by a stat- 
istical analysis of measurement results, no matter how many 
instances of them we prepare and how many measurements 
we make. 

The whole discussion above is easily generalised to more 
than two preparation procedures. 



1.3. Interlude: linguistic details 

Preparation procedures are often called 'states'; however, 
the word 'state' is equally often used to mean the density mat- 
rix representing a preparation procedure. This double mean- 
ing is quite natural, because the concepts of preparation pro- 
cedure and density matrix are, as we have seen, strictly re- 
lated. They are nevertheless distinct, as shown e.g. by the fact 
that different preparation procedures (concerning the same 
physical phenomenon) can sometimes be represented by the 



7 This is not a light assumption; cf. e.g. Peres @], pp. 50 and 424. 

8 See e.g. H3|6i|63m|6i|2iJIB& cf - 3180 G10- 
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same density matrix. We wish to keep this distinction in this 
paper, and we shall thus stick to the two distinct terms, avoid- 
ing the term 'state'. 

The terms '(one-shot) distinguishable' and 'orthogonal' are 
also often interchanged in an analogous way; e.g., one says 
that "two preparations are orthogonal" (instead of "two pre- 
parations are distinguishable"), or that "two density matrices 
are distinguishable" (instead of "two density matrices are 
orthogonal"). Such metonymic expressions are of course 
handy and acceptable, but we must not forget that 'orthogonal 
preparations' only means 'one-shot distinguishable prepara- 
tions', so that if we say "these preparations are one-shot dis- 
tinguishable and non-orthogonal" we are then contradicting 
ourselves — not an experimental or physical contradiction, 
but a linguistic, or logical, one. Just as it would be contra- 
dictory to say that a classical force field is conservative and 
its integral along a closed path does not vanish; or that, in a 
given reference frame, a point-like body is in motion and its 
position-vector is constant; or that a non-relativistic system is 
closed and its total mass is changing. 

These remarks are pedantic, and many a reader will con- 
sider them only linguistic nit-picking; but these readers are 
then invited to take again a look at von Neumann's second 
statement (|2j. Is everything in order there? We shall come 
back to this point later. 



1.4. "Quantum" ideal gases and semi-permeable diaphragms 

In order to analyse a statement which involves, besides 
quantum concepts, also thermodynamic arguments and semi- 
permeable diaphragms, it is necessary to introduce a ther- 
modynamic body possessing "quantum" characteristics, i.e., 
quantum degrees of freedom; an 'ideal' body will do very 
well. For this purpose, we shall first recall the basic rela- 
tionships between work and heat for classical ideal gases. 
We shall then follow von Neumann (JJ] and introduce the 
quantum degrees of freedom as "internal" degrees of free- 
dom of the particles constituting the gases, and shall finally 
discuss how the interaction between the quantum and thermo- 
dynamic parameters is achieved by means of semi-permeable 
diaphragms. 

Let us first recall that (classical) ideal gases are defined 
as homogeneous, uniform thermodynamic bodies character- 
isable by two thermodynamic variables: the volume V > 
and the temperature T > 0, and for which the internal energy 
is a function of the variable T alone; 9 this implies, via the 
first law of thermodynamics, that in any isothermal process 
the heat absorbed by the gas, Q, is always equal to the work 



9 See e.g. the excellent little book by Traesdell and Bharatha l73ll . or 
Planck Fl §§ 86-91, 232-236], Lewis and Randall EE chap. VI], Part- 
ington EE §§ 11.54-57, IV.14, VIIA.21], Callen El §§ 3-4, 13-1-2, 16- 
10], Buchdahl E3 §§ 71, 82]; also Samohyl E3 §§ 5, 6]. 



done by the gas, W: 

Q = W = NkT ln(V f / Vj) (isothermal processes), (6) 

where V; and Vf are the volumes at the beginning and end of 
the process, k is Boltzmann's constant, and N is the (constant) 
number of particles. This formula will be true throughout the 
paper, as we shall only consider isothermal processes. (Note 
that W and Q can assume both positive and negative values.) 

One often considers several samples of such ideal gases in 
a container and inserts, moves, or removes impermeable or 
semi-permeable diaphragms 10 at any position one pleases. 11 
What the presence of these diaphragms, both in practice and 
in theory, really signifies in the case of ideal, non-interacting 
(e.g. non chemically reacting) gas samples, is that we can 
control and monitor the variables of the ideal-gas samples, 
(Vi, 7*i ; V2, T%\ • • •) independently of each other, even when 
some of the samples occupy identical regions of space sim- 
ultaneously. Thus, the problem becomes equivalent to one 
where all gas samples always occupy distinct regions of space, 
even though they may be in mechanical or thermal contact. 12 
For "quantum" ideal gases, as will be seen in a moment, the 
situation is not so simple. 

We must first face the question of how to introduce and 
mathematically represent quantum degrees of freedom in an 
ideal gas. Von Neumann |1, § V.2] used a hybrid classical- 
quantum description, microscopically modelling a quantum 
ideal gas as a large number N of classical particles possess- 
ing an "internal" quantum degree of freedom represented by 
a density matrix g living in an appropriate density-matrix 
space. This space and the density matrix are always assumed 
to be the same for all the gas particles. 13 He then treated two 
gas samples described by different density matrices as gases 
of somehow different chemical species. This idea had been 
presented by Einstein 1 9 1 ] eighteen years earlier, but it is im- 
portant to point out that the "internal quantum degree of free- 
dom" was for Einstein just a "resonator" capable of assum- 
ing only discrete energies. This degree of freedom had for 
him discrete (the original meaning of the adjective 'quantum') 
but otherwise statistically classical properties; it was not de- 
scribed by density matrices, and it did not provide non-ortho- 
gonality issues. Einstein's idea was hence less open to prob- 
lems than von Neumann's. 



10 Partington 17 61 S 28] informs us that these were first used in thermodynam- 
ics by Gibbs l79ll . 

1 1 In the limit, this leads to a formalism involving field quantities (cf. Buch- 
dahl l78t §§ 46, 75], and see e^. Traesdell |21[ lectures 5, 6 and related 
appendices] and [22, 80, 81, 82, 83]), as indicated by the possibility of 
introducing as many diaphragms as we wish and hence to control smaller 
and smaller portions of the gas samples. 

12 In particular, the samples have separate entropies and must separately sat- 
isfy the second law [79]. When the gases do interact, we enter into the 
more complex, and still under development, thermomechanic theory of 
mixtures (see e.g. I27[]pl8ll8ll8ll85ll86ll87ll83l: cf. also OH). 

13 Dieks and van Dijk |90] point rightly out that one should then more cor- 
rectly consider the total density matrix | £>, where n = NN\ (with 
Avogadro's constant) is their total number. 
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In fact, von Neumann's conceptual device presents some 
problems. For example, the chemical species of a gas is not 
a thermodynamic variable, and even less a continuous one: 
chemical differences cannot change continuously to zero. 14 It 
would then seem more appropriate to describe a quantum ideal 
gas by the variables (V, T,g) instead, taking values on appro- 
priate sets. That this would indeed be the only correct treat- 
ment can be seen from the fact that the thought experiments 
considered by von Neumann and reproduced below always 
involve some step in which the density matrix g of a gas is 
changed. This implies that g is something which we can and 
need to control, and as such it should be included in the list of 
variables which define our thermodynamic system l93ll . 

However, in the following discussion we shall follow 
von Neumann and Peres instead and speak of a '0-gas', or a 
'i/r-gas', etc., where or ip are the density matrices describing 
the internal quantum degrees of freedom of the gas particles, 
just as if we were speaking of gases of different chemical spe- 
cies (like e.g. 'argon' and 'helium'). The thermodynamic vari- 
ables are (V, T) for each such gas. 

In this framework, semi-permeable diaphragms have dif- 
ferent meaning and function than they have in the classical 
framework. This becomes clear when we analyse how they 
are modelled microscopically. The microscopic picture 1 1 , 
p. 196] 1 3, p. 271] is, paraphrasing von Neumann, to construct 
many "windows" in the diaphragm, each of which is made 
as follows. Each particle of the gases is detained there and a 
measurement is performed on its quantum degrees of freedom; 
depending on the measurement result, the particle penetrates 
the window or is reflected, and its density matrix is changed 
in an appropriate way. In other words, the diaphragm also 
acts on the translational degrees of freedom, separating the 
particles having different preparations spatially, with an effi- 
ciency which depends on the preparations and the implemen- 
ted measurement. This implies that the number of particles 
and hence the pressures or volumes of the gases on the two 
sides of the semi -permeable diaphragm will vary, and may set 
the diaphragm in motion, producing work (e.g. by lifting a 
weight which loads the diaphragm). There arises thus a kind 
of mutual dependence between the quantum degrees of free- 
dom and the thermodynamic parameters (like the volume V) 
of the quantum ideal gases. We see that for quantum gases 
the semi-permeable diaphragms not only make the existence 
of separate thermodynamic variables for different gas samples 
possible, as it happened for (non-interacting) classical gases, 
but also perform transformations of the (not thermodynamic - 
ally reckoned) quantum variable g. 

Such a diaphragm is then simply a device which implement 
a measurement procedure, and is mathematically described by 
a given POVM and a CPM. 

Let us illustrate how semi-permeable diaphragms work 



with two examples. 

First example: one-shot distinguishable preparations 

Imagine a container having volume V and containing a mix- 
ture of N/2 particles of a z + -gas and N/2 of a z~-gas; i.e., the 
particles have quantum degrees of freedom represented by the 
density matrices 

Z +«| Z + ><Z +|£:(10), ( 7) 

z-^\z-)(z-\ = (° °), (8) 

in the usual spin- 1/2 notation (Fig. ^ a )- These two dens- 
ity matrices represent quantum preparations that can be dis- 
tinguished in one shot by an appropriate measurement, im- 
plemented by two semi-permeable diaphragms as described 
above. The first is completely opaque to the particles of the 
Z~-gas and completely transparent to those of the z + -gas; the 
other is completely opaque to the particles of the z + -gas and 
completely transparent to those of the z~-gas. Mathematic- 
ally they are represented by the two-element POVM {z ± } and 
the CPMs (projections) {g i-> z ± gz ± / tr(z ± gz ± )}, as follows. 15 
For the first diaphragm: 

Z + i-> z + , let through, with probability tr(z + z + ) = 1, (9a) 



z~ i-» f, reflected, with probability tr(z~z~) = 1 (9b) 

Z + h-> z~, with probability tr(z~z + ) = 0, (9c) 

Z~ h-> z + , with probability tr(z + Z~) = 0, (9d) 

and for the second: 

z + h-> z + , reflected, with probability tr(z + z + ) = 1, (10a) 

z~ i-> z~ , let through, with probability tr(z~z~) = 1 (10b) 

Z + i-> z~, with probability tr( z~z + ) = 0, (10c) 

Z~ 1 * z + , with probability tr(z + z~) = 0. (lOd) 



Now imagine we insert these two diaphragms in the con- 
tainer (Fig. ^ a X ver y near to its top and bottom walls re- 
spectively. We push them isothermally toward the middle of 



(a) (b) 



0.5 |z + Xc + |+ 




lz + )fe + l 


Q < 


0.5 |z->< z -| 













Figure 1 : Separation of one-shot-distinguishable quantum 
gases. 



An observation made by Partington |76, § 11.28] in a reference to Lar- 
mor IU p. 275]. 



Due to the large number of gas particles considered, the outcome prob- 
abilities are numerically equal, within small fluctuations negligible in the 
present work, to the average fraction of gas correspondingly transmitted or 
reflected by the diaphragms. 
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the container until they come in contact with each other, so 
that the container is divided in two chambers. By doing so 
we have separated the two gases, with the z + -gas in the upper 
chamber and the z~-gas in the lower one (Fig.^ b). In order 
to move these diaphragms and achieve this separation we have 
spent an amount of work equal to 

N V/2 
-2 x —kT In -y- * 0.693 NkT, (11) 

because each diaphragm had to overcome the pressure exer- 
ted by the gas to which it is opaque. Since the process is 
isothermal, the quantity above is also the (positive) amount of 
heat released by the gases. 

The semi-permeable diaphragms can also be used to realise 
the inverse process, i.e. the mixing of two initially separated 
Z + - and z~-gases. In this case the expression il It would be the 
amount of heat absorbed by the gases as well as the amount 
of work performed by them. 



Second example: non-one-shot distinguishable preparations 

The quantum degrees of freedom of two gases may also be 
prepared in such a way that no measurement procedure can 
distinguish between them in one shot, and so there are no 
semi-permeable diaphragms which can separate them com- 
pletely. This has of course consequences for the amount of 
work that can be gained by using the diaphragms. Imagine 
again the initial situation above, but this time with a mixture 
of N/2 particles of a z + -gas and N/2 of an jc + -gas, with 

x + ^\x + )(x + \ = \{\\). (12) 

The two density matrices z + and x + are non-orthogonal, 
tr(z + x + ) + 0, and this must represent the fact that the prepara- 
tion procedures they represent cannot be distinguished in one 
shot. This also means that there are no semi-permeable dia- 
phragms which are completely opaque to the particles of the 
one gas and completely transparent to those of the other gas 
and vice versa. Mathematically this is reflected in the non- 
existence of a POVM with the properties (|5j. The absence of 
such completely separating diaphragms implies that we can- 
not control their mixing in a reversible way. 

It can be shown (U |5| that in this case the separating 
process requiring the minimum amount of work, and so the 
maximum (negative) amount of heat absorbed by the gas, 16 
can be performed by two diaphragms represented by the 
two-element POVM {q ,± } and the CPMs (projections) q h-> 



Readers interested in entropy questions should note that such process is 
irreversible and that there is no reversible process with the same initial and 
final thermodynamic states, as discussed by Dieks and van Dijk 90]. 



(a) (b) 



0.5 \z*)(z*\+ 
0.5 \x*){x*\ 


Q < 


\a+)(a + \ 







Figure 2: Separation of non-one-shot-distinguishable 
quantum gases. 

a^fjcrV tr(ar ± £>ar ± ), with 

±«, ±w ±, - 1 i 2± V2 +V2 \ 
a = \a )(a = - r- r-\ , (13) 

4 \ ± V2 2 + V2/ V 

1^)^(2+ V2)~V>± = 

i , , d4) 

i[±(2± V2)V> + (2+ V2)V>], 

i.e., the Hilbert-space vectors la*) are the eigenvectors of the 
matrix A given by 

A =2 Z + 2 X = 4(l lj' (15) 

and they are orthogonal: tr( a + a~) = 0. The action of the first 
diaphragm is thus 

z + t-> a + , let through, with probability tr(ff + z + ) ~ 0.854, 

(16a) 

z + i-» a~, reflected, with probability tr(aTz + ) ~ 0.146, 

(16b) 

x + i — > ar + , let through, with probability tr(a + x + ) ~ 0.854, 

(16c) 

x + h-> a~, reflected, with probability tr(aT;c + ) as 0.146, 

(16d) 

while for the second diaphragm: 

z + h-> ar + , reflected, with probability tr(o' + z + ) * 0.854, 

(17a) 

z + t-» a~, let through, with probability tr(a' _ z + ) ~ 0.146, 

(17b) 

x + t-» a + , reflected, with probability tr(a + x + ) ~ 0.854, 

(17c) 

x + i — > a ~, let through, with probability tr(ff _ a: + ) m 0.146. 

(17d) 

By inserting these diaphragms in the container as in the 
preceding example, they will transform and separate into two 
chambers our z + - and * + -gases, leaving in the end an a + -gas 
in the upper chamber and an aT-gas in the lower one (Fig.|2ji. 
The gases will have the same pressure as the original ones 
but occupy unequal volumes, because the ratio for the z + - and 
* + -gases to be transformed into an a^-gas and an aT-gas was 
approximately 0.854/0.146, as seen from Eqns. (I16> : this will 



7 



also be the ratio of the final volumes. The total amount of 
work we spent for this separation is 

0.146 V 0.854 V 

- 0. 146 NkT In — 0.854 NkT In — - — * 

0Al6NkT (18) 

(the first term is for the upper diaphragm, opaque to the a - 
gas, and the second for the lower diaphragm, opaque to the 
£c + -gas). This is also the amount of heat released by the gases. 

The significance of the equations above is that the half/half 
mixture of z + - and * + -gases can be treated as a 0.854/0.146 
mixture of an ce + -gas and an aT-gas. In fact, the two semi- 
permeable diaphragms could be used to completely separate 
an ar + -gas from an aT-gas, whose preparations are one-shot 
distinguishable. 

We shall see an application of the coupling between the 
thermomechanic and quantum degrees of freedom provided 
by semi-permeable diaphragms in the second part of the pa- 
per. For the moment, we want to stress the following two-way 
connexion: the existence of a measurement which can distin- 
guish two preparations in one shot implies the possibility of 
constructing a semi-permeable diaphragm which can distin- 
guish and separate the two preparations. Vice versa, if we had 
such a diaphragm we could use it as a measurement device to 
distinguish in one shot the two preparations, and so it would 
imply their one-shot distinguishability. 

The notion of a semi-permeable diaphragm implies of 
course something more, viz., the possibility of a coup- 
ling between the quantum degrees of freedom to be distin- 
guished and the translational degrees of freedom. It is this 
coupling, done with a hybrid quantum-classical framework 
(von Neumann 01 and the present paper) or completely within 
quantum mechanics (Peres |3]), that allows applications and 
consequences of thermodynamic character. However, the 
two-way connexion pointed out above is independent of the 
fact that the diaphragm may also have applications or con- 
sequences of thermodynamic character. 

1.5. Distinguishability, orthogonality, and the second law 

We can now summarise the observations and results that we 
have gathered up to now: 

The existence of a measurement which can distinguish two 
preparation procedures in one shot is equivalent to the ex- 
istence of semi-permeable diaphragms which can separate 
the two preparations, and is also equivalent, by definition, 
to saying that the two preparations are one-shot distinguish- 
able. Mathematically, the measurement and the diaphragms 
are then represented by a POVM satisfying Eqns. (0, or equi- 
valently by the orthogonality of the density matrices repres- 
enting the two preparations. 

Let us now look again at the two statements Q and (0. 
We understand that, by "state", von Neumann meant a density 



matrix (or a Hilbert-space ray, which can be considered as a 
particular case thereof). 
The first statement Q, 

"Two [density matrices] 0, if/ can certainly be 
separated by a semi-permeable diaphragm if they 
are orthogonal", 

seems to be saying just this: the orthogonality of two dens- 
ity matrices (represents (mathematically) the fact that their re- 
spective preparation procedures are one-shot distinguishable, 
i.e., can be separated by some device. The demonstration of 
this statement does not really require thermodynamic argu- 
ments, as we have seen. 17 
The second statement was: 

"If [the density matrices] </>, if/ are not orthogonal, 
then the assumption of such a semi-permeable 
diaphragm [which can separate them] contradicts 
the second law of thermodynamics". 

Let us now try to analyse it. The statement mix together math- 
ematical concepts (density matrices, non-orthogonality) and 
physical ones (semi-permeable diaphragm), so we may try to 
state it either in physical or in mathematical terms only. 

Saying that two density matrices are not orthogonal is the 
mathematical way of saying that their corresponding prepara- 
tion procedures are not one-shot distinguishable. So the state- 
ment becomes: 

If two preparations are not one-shot distinguish- 
able, then the assumption of a semi-permeable 
diaphragm which can separate them contradicts 
the second law. 

But in the previous sections we have shown that assuming the 
existence of a separating semi-permeable diaphragm is equi- 
valent by construction to assuming the existence of a meas- 
urement which can distinguish the preparations in one shot. 
So what the second statement is saying is just the following: 

If two preparations are not one-shot distinguish- 
able, then the assumption that they are one-shot 
distinguishable contradicts the second law. 

So the statement assumes that two preparations are one- 
shot distinguishable and not one-shot distinguishable; or in 
mathematical terms, that their density matrices are orthogonal 
and not orthogonal. It has the logical form '(A A -A) => B', 
and makes thus little sense, because it starts from contradict- 
ory premises. In particular, it cannot have any physical im- 
plications for the second law of thermodynamics. We must 



What von Neumann's demonstration of the first statement really shows is 
something else: viz., that his quantum entropy expression is consistent with 
thermodynamics. This, however, does not concern us in the present paper, 
and will perhaps be analysed elsewhere l93ll . 
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in fact remember that, in a logical formal system, from con- 
tradictory assumptions ('A A -A') one can idly deduce any 
proposition whatever ('fi') as well as its negation ('-1Z?') (see 

e.g. innm). 

The same contradictory premises are also present in Peres' 
demonstration of the second statement by means of a thought- 
experiment. This experiment will be discussed in detail in 
the second part of this paper, for a different purpose. But we 
can anticipate its basic idea only in order to see where the 
contradictory premises lie: 

A container is divided into two chambers containing 
two quantum-ideal-gas samples. These are represented 
by non-orthogonal density matrices. Two semi-permeable 
diaphragms are then used to completely separate the two 
samples, isothermally, just as in the first example of § 11.41 In 
this way some heat is absorbed by the gases, or equivalently, 
some work is done by them. The process is then completed so 
as to come back to the initial situation, and it is easily shown 
that a net amount of heat is absorbed by the gases (or a net 
amount of work done by them), violating the second law. 

The contradictory premises are the following. It is as- 
sumed, on the one hand, that the two initial preparations of 
the quantum-gas samples are not one-shot distinguishable, as 
is reflected by the use of non-orthogonal density matrices; i.e., 
that there does not exist a measurement procedure able to dis- 
tinguish and separate them in one shot. On the other hand, it 
is assumed immediately thereafter that there is such a meas- 
urement procedure, as is reflected by the existence of the sep- 
arating diaphragms which must necessarily implement it; in 
particular, the quantum-gases have then to be represented by 
orthogonal density matrices. This is a contradiction of course, 
and it has nothing to do with thermodynamic, work extraction, 
or heat absorption. The thought-experiment cannot be contin- 
ued if we do not state clearly which of the mutually exclusive 
alternatives, one-shot distinguishability or not-one-shot dis- 
tinguishability, i.e. orthogonality or non-orthogonality, is the 
one meant to hold. 

Note that we are not saying that there are no relationships or 
valid consistency considerations between quantum mechanics 
and thermodynamics. In fact, what many of von Neumann's 
and Peres' analyses and thought-experiments really demon- 
strate is that the quantum-mechanical physical concepts and 
principles are consistent with the thermodynamic ones, which 
is a fundamental result. As Peres |3, p. 275] states it, "It 
thus appears that thermodynamics imposes severe constraints 
on the choice of fundamental axioms for quantum theory". 18 
What we are saying is that the existence of a semi-permeable 
diaphragm that can separate two preparations, and the non- 
one-shot distinguishability of these preparations, are contra- 



dictory assumptions within the quantum-mechanical theory it- 
self, and so cannot be simultaneously used to study the con- 
sistency of this theory with another one (thermodynamics). 
Using an example given in § 11.31 we can discuss the consist- 
ency of a particular microscopic force field with macroscopic 
thermodynamic laws; but it is vain to assume that the force 
field is conservative and its integral along a closed path does 
not vanish in order to derive a violation of the second law 
— these assumptions are contradictory by definition and any- 
thing can be vacuously derived from them. 

Peres' thought-experiment, though, does not lose its value 
because of its contradictory premises. The reason lies in its 
subtle (and nice) presentation: not as an impersonal reason- 
ing, but as an interplay between two observers (us and a "wily 
inventor", as Peres calls him J3, p. 275]). It is thus possible to 
re-interpret the contradictory premises as different, contrast- 
ing pieces of knowledge of two observers. This will be now 
done in the second part of the paper. 



PART II: CONTRASTING PIECES OF KNOWLEDGE 
AND VIOLATIONS OF THE SECOND LAW 



"One can 't perhaps," said Humpty Dumpty; "but two 
can. With proper assistance, you might have left off at 
seven." 1331 



2.1. The dependence of a density matrix on the observer's 
knowledge and on the set of preparations and measurements 
chosen 



The fact that a density matrix always depends on the par- 
ticular knowledge about the properties of the preparation it 
represents, has been stressed amongst others by Kemble |4, 
p. 102110 prx 1155-57], Jaynes |^||JiJl3l, Caves et 
al. O El El El, and ourselves (3 EJEiF 9 Discus- 
sions and analyses of the compatibility of density-matrix as- 
signments by different observers have already been offered, 
e.g., by Brun et al. 11311 . and Caves et al. 11411 . 

Note in particular that a density matrix can be associated to 
a preparation (and a POVM to a measurement) only after we 
have specified the whole sets of preparation and measurement 
procedures we want or have to consider — indeed, it is quite 
appropriate to say that these sets define our 'system'. This 
might appear paradoxical to some, but is clearly reflected in 
the fact that, in the description of a phenomenon, we always 
have to choose a particular Hilbert space before introducin 



any density matrix. This fact can be inferred from Peres 



icing 
' |31 



We thank an anonymous reviewer for the American Journal of Physics for 
pointing out this passage to us. In the same paragraph, Peres also mention 
a petitio principii which, however, concerns the proof of the equivalence of 
the von Neumann and thermodynamic entropies. 



Caves, Fuchs, and Schack see the density matrix as a sort of "quantum" 
analogue of a probability, and develop a "quantum" probability theory in 
analogy with ("classical") probability theory (with analogous theorems, 
like e.g. Bayes' and de Finetti's |nl|llB©mi23II031)- We should 
instead prefer to derive the quantum formalism as a particular application 
of probability theory llTlllal . 
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and Hardy's IT9I1 work, and we have tried to make it mathem- 
atically explicit elsewhere fl^.fl7lfl8il . 

Thus, if we add a single measurement procedure — perhaps 
a newly discovered one — to the chosen set of measurement 
procedures, we must in some cases change (numerically and 
even dimensionally) the density matrices we had previously 
associated to the preparations. 20 



2.2. The value of Peres' thought-experiment from a different 
perspective 

Peres' thought-experiment is particularly suited to show the 
points above. In its presentation, two observers have con- 
trasting pieces of knowledge about the preparations of the 
quantum ideal gases: the first observer thinks them not to 
be one-shot distinguishable, and represents them accordingly 
by non-orthogonal density matrices; the second observer, in- 
stead, knows they are one-shot distinguishable and, in fact, he 
possesses actual devices — the semi-permeable diaphragms 
— by which he separates them. This is reflected in the fact 
that the set of measurement procedures considered by the 
second observer contains (at least) a measurement (viz., the 
one which can distinguish in one shot the gas preparations) not 
contained in the set of the first observer, simply because the 
first observer did not know about the existence of that meas- 
urement. 

A similar, two-observer style of presentation was chosen by 
Jaynes 1 20, § 6] for a thought-experiment which showed that 
two different observers may have different pieces of know- 
ledge and so give different thermodynamic descriptions for 
the same physical situation. When they interact, strange res- 
ults may arise; e.g., it may appear to one of them as if the other 
were violating the second law. From this point of view Jaynes' 
and Peres' thought-experiments are very similar also in their 
results; the similarity is the more fascinating because Jaynes' 
experiment is completely within a classical, not quantum, con- 
text. 

Grad [ 108, p. 325] also remarked that not only two observ- 
ers, but even more simply the same observer facing two dif- 
ferent applications, may use different pieces of knowledge and 
descriptions to study the same physical situation. 



For this reason, we shall analyse Peres' thought-experiment 
with the following scheme. First, we present it from the point 
of view of the first observer, that we call Tatiana, which sees 
a seeming violation of the second law. Then we go over to 
Jaynes' simpler thought-experiment, and see how the two ob- 
servers' (Johann and Marie) contrasting descriptions are re- 
solved there. With the insight provided by Jaynes' experi- 
ment, we turn back to Peres', and re-analyse it from the point 
of view of the second observer, that we call Willard, see- 
ing that he had some additional knowledge and measurement 
means with respect to Tatiana, and that for him the second law 
is not violated; on the other hand, Tatiana will have to change 
the dimensions and numerical values of the density matrices 
used in her description if she wants to make allowance for 
Willard's new measurement procedure. 21 



2.3. Peres' thought-experiment: Tatiana's description 

Peres' demonstration |3, pp. 275-277] can be presented as 
follows. We are in a quantum laboratory, where the physicist 
Tatiana is studying a container wherein two quantum-ideal- 
gas samples are confined to two chambers, having volume 
V/2 each and separated by an impermeable diaphragm. From 
some measurements made on a series of identically prepared 
containers, she has chosen to describe the internal quantum 
degrees of freedom of the ideal gases by means of the density- 
matrix space for a quantum two-level system. In her descrip- 
tion, the upper chamber contains a z + -gas, the lower an x + - 
gas, where z + and x + are the density matrices defined in § 11.41 
Eqns. (0 and dl2l . They are non-orthogonal, \x{z + x + ) + 0, 
because to Tatiana's knowledge there are no means to distin- 
guish the two corresponding preparations in one shot, as she 
could not separate the two gases completely with any semi- 
permeable diaphragms known to her. 

Enters a "wily inventor"; let us call him Willard. He claims 
having produced two such semi-permeable diaphragms, 
which can completely separate the two gases. 22 In fact, by 
means of them he reversibly mixes the two gases, obtaining 
work equal to Q = NkT\n2 * 0.693 NkT (cf. Eqn. flj)). 
From Tatiana's point of view this is quite surprising! 

Now she has a single container of volume V filled with a 
half/half mixture of z + - and jc + -gases (Fig. [3] b). She wants 



This fact is related to the restrictions ("compatibility" or "positivity do- 
mains") of the set of statistic al matrices for the "redu ced dynamics" of 
some open quantum systems 1 10 il l 102. 103, 104, lOSl llOfUlOTll . These 
restrictions simply arise because one wants to describe a set of preparation 
procedures by means of a density-matrix space that is instead only meant 
for, and is only appropriate for, a particular smaller set. On the other hand, 
the reason why one wants to do this is that quantum mechanics has the fol- 
lowing unfortunate property: if you want to add just one more preparation 
or measurement procedure to the sets that interest you, you must be ready 
to "pay" for at least 2N + 1 more dimensions for your matrix-spaces, where 
N 2 - 1 is the dimension of the (normalised-Hermitean-matrix) spaces you 
were using (i.e., you pay the difference between (N + l) 2 - 1 and N 2 - 1). 
Compare this with the classical "cost" of only 1 more dimension. 



In some situations it might perhaps be necessary to abandon the quantum- 
theoretical physical principles partially or altogether, in favour of other 
more economical or aesthetically pleasing. Note that it is easily proven 18] 
(cf. lid. § V], Holevo 11091 chap. I]) that there always exist appropriate (if 
necessary, infinite-dimensional) quantum-mechanical density-matrix- and 
POVM-spaces by which one can represent sets of preparation and measure- 
ment procedures having any statistical properties whatever. In this sense, 
the quantum-theoretical mathematical formalism can never be "proven 
wrong" (but note that the same also holds for the classical statistical form- 
alism 1 109, § 1.7]), although it may be redundant (see preceding footnote). 
22 This being a scientific paper, we ideally (i.e., unrealistically) assume Wil- 
lard's honesty and the absence of any fraud. 
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Figure 3: Quantum gas experiment from Tatiana's point of 
view. 



to return to the initial situation, but she cannot use Willard's 
puzzling means, of course. For her the situation is now the 
same as that discussed in the second example of § 11.41 the 
gas mixture is for her equivalent (Fig. |5J d) to a mixture of 
approximately 0.854 parts of an ar + -gas and 0.146 parts of an 
aT-gas, where a + and a~ are the density matrices defined in 
Eqn. \\A\ . Tatiana uses two semi-permeable diaphragms to 
separate the two a^-gases, the cr + -gas into a 0.854 fraction of 
the volume V, and the a -gas into the remaining 0.146 frac- 
tion (so that they have the same pressure), and spends work 
equal to -Q" = -MT(0.854 In 0.854 + 0.146 lnO.146) ~ 
0.416 NkT, cf.Eqn.fl8). 

Tatiana then performs two operations corresponding to 
unitary rotations which reversibly change the density matrices 
associated to the two gases into the same density matrix, say 
z + , so that the two chambers eventually contain for her the 
same z + -gas. She then eliminates the diaphragms and reinserts 
another impermeable one in the middle to divide the gas into 
two chambers of equal volume (Fig. [3] e) — which is, from 
her point of view, a reversible operation — , and finally per- 
forms again an operation represented by a rotation z + h> x + 
of the density matrix associated to the gas in the lower cham- 
ber. In this way she has apparently re-established the original 
condition of the gases (Fig. [3] a), which have thus undergone 
a cycle. The last operations were assumed to be performable 
without expenditure or gain of work, hence without heat ex- 
change either. 23 

Tatiana summarises the results as follows: a cycle has been 



Von Neumann j]J pp. 194, 197] and Peres p. 275] assert that unitary 
rotations can be realised without heat exchange, but work exchange is al- 
lowed and indeed sometimes necessary. However, in our present discus- 
sion we have assumed all processes to be isothermal and all gases ideal, 
which implies that any reversible isochoric work exchange (like the unit- 
ary rotations) would be accompanied by an equivalent reversible heat ex- 
change (see § 11.41 . with a consequent undesired entropy change. This is 
why Tatiana's final isochoric unitary rotations must be performed with no 
energy exchange. This issue is related to the problematic way in which 
the quantum and classical or thermodynamic descriptions are combined; 
namely, the density matrices are not thermodynamic variables au pair with 
the real numbers (V and T) describing the gas; cf. § 11.41 



completed because the initial and final situations are the same. 
The total heat Q absorbed by the gases equals the work — 
experimentally measured — done by them and amounts to 

Q = Q' + Q" ~ (0.693 - 0.416) NkT = 0.277 NkT > 0. (19) 

Hence, she sees a violation of the second law Q because, 
from her point of view, 

Q > in the cyclic process. (20) 

Tatiana accuses Willard of having violated the second law 
by means of his strange semi-permeable diaphragms that 
"completely separate non-orthogonal density matrices". 

Has the second law really been violated? Also from Wil- 
lard's point of view? We do not answer these questions now; 
instead, we leave the quantum laboratory where Tatiana and 
Willard are now arguing after their experiment, and enter an 
adjacent classical laboratory, where we shall look at Jaynes' 
demonstration 12011 . The situation there is in many respects 
very similar to the previous one, though it is completely "clas- 
sical". 

2.4. Jaynes' thought-experiment 

In the classical laboratory, we have a container wherein 
an ideal-gas sample is equally divided into two chambers 
of volumes V/2 each and separated by an impermeable dia- 
phragm (Fig. |3 a). From the measurements made by the sci- 
entist Johann, the gas in the two chambers is exactly the same, 
"ideal argon". 24 For Johann it would thus be impossible, not 
to say meaningless, to find a semi-permeable diaphragm that 
be transparent to the gas in the upper chamber and opaque 
to the gas in the lower one, and another diaphragm with the 
opposite properties. 

The scientist Marie, also in the laboratory, states never- 
theless that she has in fact two diaphragms with those very 
properties. She uses them to reversibly and isothermally mix 
the two halves of the gas, obtaining work equal to Q - 
NkT In 2 * 0.693 NkT (Fig. g] b), and leaving Johann stu- 
pefied. 

In fact, from Johann's point of view Marie has left things 
exactly as they were: he just needs to reinsert the impermeable 
diaphragm in the middle of the container and for him the situ- 
ation is exactly the same as in the beginning: the one gas is 
equally divided into two equal chambers (Fig.0] c a). 

Johann's conclusion is the following: The initial and final 
conditions of the gas are the same, so a cyclic process has been 
completed. The work obtained — experimentally measured 
— equals the heat absorbed by the gas, 

Q = Q' ~ 0.693 NkT > 0, (21) 



Real argon, of course, behaves like an ideal gas only in certain ranges of 
temperature and volume. 
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Figure 4: Classical gas experiment from Johann's point of 
view. 



and so for Johann 

Q > in the cyclic process (22) 

(cf. Tatiana's Eqn. MOD , which plainly contradicts the second 
law of thermodynamics 0. 

We see that what happens here is completely analogous to 
what has happened in the quantum laboratory: a physicist has, 
from initial measurements, a particular description of a given 
situation. Then a process takes place that contradicts the phys- 
icist's mathematical description, and the re-establishment of 
what was thought to be the initial situation yields an apparent 
experimental violation of the second law. 

The reader has no doubt noticed that the facts were presen- 
ted not only from Johann's point of view, but also, so to speak, 
taking side with him. In fact, we plainly ignored Marie's 
experimental performance in our mathematical description. 
But it should be clear that, since the two gas samples in the 
chambers behaved differently with respect to Marie's semi- 
permeable diaphragms (one sample exerted pressure on one 
of the diaphragms but not on the other, and vice versa for the 
other sample), then they must actually be samples of two dif- 
ferent gases, 25 contrary to what Johann (and we) believed and 
described mathematically. 

Note that this fact does not completely contradicts Johann's 
point of view. It simply means that the two samples behave 
exactly in the same manner with respect to Johann's exper- 
imental and measurement means, and so he was justified to 
consider them as samples of the same gas. But now Johann 
has experimental evidence that the two samples behave dif- 
ferently in certain circumstances, and so, in order to avoid 
inconsistencies, they have to be considered as samples of dif- 
ferent gases, at least in all experimental situations in which 
their difference in behaviour comes about (such as the mixing 
performed by Marie). 

So let us see how the whole process has taken place from 
Marie's point of view. She explains that the gas samples ini- 
tially contained in the two chambers were two different kinds 
of ideal-argon, of which Johann had no knowledge: 'argon a' 
("Ar) and 'argon V (''Ar). Argon a is soluble in whafnium 
while argon b is not, but the latter is soluble in whifnium, 



Provided, of course, that this phenomenon is also reproducible. 
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Figure 5: Classical gas experiment from Marie's point of 
view. 



a property not shared by the a variety. Marie's separation 
of the two gases "Ar and h Ai was possible by means of two 
semi-permeable diaphragms made of whifnium and whafnium 
that take advantage of these different properties. (Of course, 
Jaynes' and our speaking of 'argon a , 'argon V , 'whafnium', 
and 'whifnium' in this imaginary experiment is just a coloured 
way of stating that the two samples are of different gases. The 
reader can, if he or she so prefers, simply call them 'gas A' 
and 'gas B' and take into account their different behaviour 
with respect to the two semi-permeable diaphragms.) 

According to Marie (Fig. the second law is not viol- 
ated: Initially the two gases °Ar and fc Ar were completely 
separated in the container's two chambers (Fig. |5j a). After 
her mixing and extracting work, the container contained an 
equal mixture of °Ar and 6 Ar (Fig. |5J b). Upon Johann's 
reinsertion of the impermeable diaphragm the container was 
again divided in two equal chambers, but each chamber con- 
tained a half/half mixture of °Ar and ^Ar (Fig. [5] c), and 
this was different from the initial condition (Pig. EI a )- Thus 
the cycle was not completed, although it appeared so to Jo- 
hann, and so the form of the second law for cyclic processes, 
Eqn. 0, cannot be applied. To close the cycle one has to 
use the semi-permeable diaphragms again to relegate the two 
gases to two separate chambers, and must thereby spend an 
amount of work -Q" at least equal to that previously obtained, 
-Q" > Q', an d the second law for the completed cycle is 
satisfied: Q = Q' + Q" < 0. 

The simple conclusion, stated in terms of entropy, that 
Jaynes |20, § 3] draws from this demonstration, is that 

it is necessary to decide at the outset of a prob- 
lem which macroscopic variables or degrees of 
freedom we shall measure and/or control; and 
within the context of the thermodynamic sys- 
tem thus defined, entropy will be some func- 
tion S (Xi , . . ., X„) of whatever variables we have 
chosen. We can expect this to obey the second 
law [Q/T < AS] only as long as all experi- 
mental manipulations are confined to that chosen 
set. If someone, unknown to us, were to vary 
a macro variable X n+ \ outside that set, he could 



Jaynes 20, § 5] explains that 'whifnium', as well as 'whafnium', "is one 
of the rare superkalic elements; in fact, it is so rare that it has not yet been 
discovered". 
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produce what would appear to us as a violation 
of the second law, since our entropy function 
S (X] , . . . , X„) might decrease spontaneously [i.e., 
without absorption of heat (Q = 0)], while his 
S(Xi, . . . , X„,X„ + i) increases. 



This is old wisdom: Grad fifll p. 325] (see also IllcUlllIn 
explained thirty-one years earlier that 

the adoption of a new entropy is forced by the 
discovery of new information. [. . . ] The exist- 
ence of diffusion between oxygen and nitrogen 
somewhere in a wind tunnel will usually be of 
no interest. Therefore the aerodynamicist uses an 
entropy which does not recognise the separate ex- 
istence of the two elements but only that of "air". 
In other circumstances, the possibility of diffu- 
sion between elements with a much smaller mass 
ratio (e.g., 238/235) may be considered quite rel- 
evant. 

But Jaynes' and Grad's remarks do not apply only to en- 
tropy; they have greater generality. We always choose some 
variables — with particular ranges, scales, and governing 
equations — to describe a physical phenomenon. And such 
a choice always represents, and is dependent on, the particu- 
lar knowledge that we have about that phenomenon, or that we 
think is sufficient to describe it in a given situation or applic- 
ation. 27 This is true in particular for the quantum-mechanical 
density matrices, Hilbert spaces, and so on. 



2.5. Peres' thought-experiment: Willard's description 

With the insight provided by the analysis of the classical 
experiment and by Grad's and Jaynes' remarks, we can return 
to the quantum laboratory and look with different eyes at what 
happened there. 

Just as in the case of Johann and Marie, we must admit that 
in the presentation of the quantum experiment we took not 
only Tatiana's point of view, but also her parts, disregarding 
Willard's experimental evidence in our mathematical descrip- 
tion. In fact, Willard showed that the two quantum ideal gases 
can be completely (and, it is assumed, reproducibly) separ- 
ated; and as shown in § 11.51 this means that there must ex- 
ist a measurement procedure by which the two corresponding 
quantum preparations can be distinguished in one shot. This 
is not in complete contradiction with Tatiana's initial descrip- 
tion: with the measurement means at her disposal, the two 
preparation procedures were not distinguishable in one shot, 



27 For an excellent discussion on different levels and scales of description in 
the particular case of thermodynamics, see Wood ITTll : also Samohyl I2I 
especially § 7] and Jaynes [ 113, § 1.2]. 



and so for her they were appropriately represented by non-or- 
thogonal density matrices. But, in order to avoid inconsisten- 
cies, the two preparation procedures have to be represented by 
orthogonal density matrices in all experimental situations in- 
volving the new measurement capability — such as Willard's 
mixing process. 

Willard thus represents the two quantum ideal gases by or- 
thogonal density matrices, and his mathematical analysis of 
the thermodynamic process is different from Tatiana's. Let us 
follow, step by step, a possible explanation of the quantum-gas 
process from his point of view. He explains that the internal 
quantum degrees of freedom of the gases are best represented 
by the density-matrix space for a quantum four-level system 
(it might be that the molecules of the gas where diatomic and 
not mono-atomic as Tatiana believed), of which Tatiana only 
used a subspace because of her limited measurement means 
(e.g., she probed the internal quantum degrees of freedom of 
only one of the molecule's atoms) . In other words, part of 
the density-matrix space used by Willard was "traced out" in 
Tatiana's description, because she had only access to measure- 
ment procedures represented by a portion of the total POVM 
space. 

Denoting by {|z + z + ), \z~ z + ), \z + Z~), \z~ z~)} the orthonormal 
basis for the Hilbert space used by Willard, Tatiana could not 
distinguish, amongst others, the preparations corresponding 
to \z + z + ) and to |z + z~), both of which she represented as |z + ), 
nor those corresponding to \x + z + > — (|z + z + > + |z~ z + »/ V2 and 
to |x + z~) — (|z + z~) + |z~Z - ))/ V2, which she represented as 
Tatiana's projection is thus of the kind \<f> ip) h-> \<j>): 



|z + z + W|z + >, |z-z + W|z->, 
|z + z~W|z + >, \z~z~)^\z~), 



(23) 



from which also follows 



|x + z + >^|x + ), |jTz + > ^ \x~), 
\x + z~) i-> \x + ), \x~ z~) \x~). 



(24) 



From Willard's point of view, the process went as follows. 
He had also made some measurements on identically prepared 
containers, and according to his results, the container's two 
chambers initially contained z + - and Jc + -gases (Fig. [6] a), with 



l + "\z + z + ){z + z + 
x + ^ \x + z -){x + z- 



1000 
0000 
0000 
0000 

0000 
0000 
0010 
0000 



(25) 
(26) 



These density matrices are orthogonal, ti(z + x + ) = 0, be- 
cause his measurement means allow him to distinguish the 
two corresponding preparations in one shot. This is precisely 
what Tatiana could not do, instead, and so she represented 
the two preparations by the non-orthogonal density matrices 
z + = |z+Xz + | and*+ = |0<z-|. 

Willard mixed the two separable gases with his semi- 
permeable diaphragms, obtaining work (Fig. |6] b), so that the 
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(a) 



(b) 



(c) 



|z+z+Xz + z+l 


e' > o 


0.5 |z + z + Xz + z + |+ 
0.5 U + z -><.r + z-| 




0.43 |ff + z + ><a + z + |+ 
0.07Kz + )<ff"z + |+ 
0.43 |ar + z-><a + n+ 
0.07 lor" z~)(a~ Z'\ 


\x* z-)(x* -,-\ 






(0 * (a) 


(-C" > e' + g") 


(e) 


(d) 


-e" < a 


0.5 |z + z + Xz + z + |+ 
0.5 \z + z-)(Z + Z-\ 




0.5 |z + z + Xz + z + l+ 
0.5|z + z-Xz + z"l 




0.5 |u + z + X« + z + l+ 
0.5 \a*z~){a*z'\ 


0.5 \x*z*)(x* z + |+ 
0.5 |x + z-><.v + z-| 




0.5 |z + z + Xz + z + l+ 
0.5 \z+z-)(z+z-\ 










1 i — * 

0.5 z + |+ 
0.5 [o-rXofz - ! 



Figure 6: Quantum gas experiment from Willard's point of 
view. 



container eventually contained a r-gas, where 



1-4 1.4 

-z + -* 

2 2 



/ 1 o o o 
1 1 o o o o 

2 1 
\ 



(27) 



Tatiana's subsequent separation by means of her semi- 
permeable diaphragms (Fig. |S] c d) which can separate the 
preparations corresponding to a + and a~, is represented by 
Willard by the two-element POVM {E + , E~], with 



|o*z + ><o* z + \ + \a t z-)(ae t z-\ 

(2±^j2 ±V2 



- I 

4 



^ 

±V2 2=fV5 
2±V5 ±V5 
±V2 2=fV2^ 



(28) 



where 



la* z + > d = r (2 ± V2)~ 2 (|z* z + ) ± |x* z + », 
10*2-) « ( 2 + V^flzV) ± IrV}) 



(29) 
(30) 



(cf. Eqn. dl4t ). and the associated CPMs (projections) r h-> 
E ± tE ± /Xt(E ± tE ± ). 

That separation led to a chamber, with volume 0.854 V, 
containing the gas mixture 



l -\a + z + ){a + z + \+ l -\a + Z -){a + Z - 



(31) 



and another chamber, with volume 0. 146 V, containing the gas 
mixture 



1 - 4 -4 1 

-\a z ){a z | + -\a z ){a z 



(32) 



Note that Tatiana could not notice that these were mixtures, 
because of her limited instrumentation. Moreover, from Wil- 
lard's point of view the separation brought about a transform- 
ation of the gases' quantum degrees of freedom. 

The following step corresponded to the unitary rotations 



\a + z + )(a + z + \ h> \z + z + )(z + z + 
\a + Z -)(a + zl i-> \z + z~)(z + z- 



(33) 
(34) 



for gases in the upper chamber, and 

\a- Z + )(a-z + \ h> |z + z + ><z + z + |, 
\a~ z~){a~ z~\ h> |z + z~)<z + z~| 



(35) 
(36) 



for the gases in the lower chamber. The successive elimin- 
ation and reinsertion of the impermeable diaphragm yielded 
two chambers of equal volume V/2 and same content, viz. the 

mixture of gases described by |z + z + )(z + z + | and |z + z~)(z + z~| 
(Fig. El e). 

Tatiana's final rotation for the gas in the lower chamber, 



|z + z + ) ' * |x + z + ), 



|z + z~> \x + z~), (37) 



only led to two equal chambers containing the mixtures 



of 



5 U 

^Z" 



')(z + z + \ 



-)(z + z~\ 



and 



z + ){x + 



s|a <. X* + z~| gases respectively (Fig.|6j f). 

From her point of view, i.e., from what her measuring 
means could tell, this final situation was identical with the 
initial one (Fig. [6] a), i.e. a z + -gas in one chamber and an x + - 
gas in the other, and so she thought the thermodynamic cycle 
completed. But we see now that the final and initial situations 
were in fact different. Hence the second law was not violated, 
because the cycle was not completed, and the form for the 
second law does not apply. 

It is also easy to see that in order to return to the initial 
condition an amount of work -Q'" Mx ( 1 /4)NkT In 2 as 
0.693 NkT has to be spent to separate the |z + z + )(z + z + |- 
gas from the |z + z~)(z + Z~| -gas, and analogously for the 
\x + z + )(x + z + |- and \x + z~)(x + z~|-gases. A final operation must 
then be performed corresponding to the rotations of the dens- 
ity matrices |z + z~)<z + z~| and \x + z + )(x + z + | to |z + z + )<z + z + | 
and \x + z~){x + z~\ respectively, and we have finally reached 
again the initial condition (Fig. [6] a). The total amount of 
heat absorbed by the gases in this completing process would 
then be 



Q = Q' + Q" + Q'" < (0.693 - 0.416 - 0.693)NkT = 

- 0Al6NkT < 0, (38) 

and the second law, for the completed cycle, would be satisfied 
(strictly so: we see that the whole process is irreversible, and 
it is easy to check that the only irreversible step was Tatiana's 
transformation and separation into a + - and a^-gases). 

The original conclusion has thus been reversed: no viola- 
tion of the second law is found. 



3. CONCLUSIONS 

Preparation procedure and density matrix, with their re- 
spective properties, are quite different concepts, but intimately 
related because the latter is a mathematical representation of 
the former. In particular, the orthogonality of two density 
matrices mathematically represents the fact that their corres- 
ponding preparation procedures can be distinguished in one 
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shot. The one-shot distinguishability of two preparation pro- 
cedures is also equivalent to the possibility of separating them 
by appropriate semi-permeable diaphragms. This equivalence 
is important because it relates their statistical and gross ther- 
modynamic properties. 

A consequence of this physical-mathematical relation is 
that an observer cannot assume the non-orthogonality of two 
density matrices and, at the same time, claim the one-shot dis- 
tinguishability of their corresponding preparation procedures, 
because this assumption is self-contradictory and can only 
lead to vain conclusions. We have seen this kind of contra- 
dictory assumption behind the second statement (0. 

However, two observers can represent the same prepara- 
tions by means of density matrices having different numerical 
values and properties (e.g., orthogonality), and even different 
dimensionality. This may happen either because the two ob- 
servers have different pieces of knowledge about the prepara- 
tions' properties and the measurements available; or because 
each observer, having a different purpose or application, con- 
siders different sets of preparations and measurements to de- 
scribe the same physical phenomenon. In particular, proper- 
ties like one-shot distinguishability — and thus also orthogon- 
ality — always depend on the particular set of measurement 
procedures which an observer decides to consider (sometimes 
the whole known set at his or her disposal). 

We have seen an example of this point in Peres' thought- 
experiment: An observer does not know of any measurement 
procedure that can distinguish in one shot two particular pre- 
parations; accordingly, she represents them by non-orthogonal 
density matrices. A second observer then shows the existence 
of such a measurement procedure, and so both observers must 
then use orthogonal density matrices to represent the prepar- 
ations, at least when they want to operate with that measure- 
ment. 

In any case, the consistency between the physical phenom- 
ena considered and their mathematical description is always 
essential. 
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Appendix: ORTHOGONALITY PROOF 

Suppose that two preparation procedures are represented by 
density matrices <f> and \)/, and that we have a measurement 
procedure represented by a POVM {E^, F v ] such that 

tr(0^) = and tr(^) t for all E^, 
tr(^F v ) = and tv(<f>F v ) + for all F v . 

These equations mathematically represent the condition of 
one-shot distinguishability, as discussed in § 11.21 

We want to prove that from the above equations it follows 
that the two density matrices are orthogonal: 

tr(0(^) = 0. (A.2) 

To prove this, let us consider the two-element POVM {E, F} 
given by: 

E^J]E„ F^YjFy. (A3) 

H v 

This POVM represents a "coarse graining" of the original 
measurement procedure, with some results grouped together. 
It is easy to see that it satisfies 

ti{d>E) = and XxtyE) =1^0, 

(A.4) 

tr(i//F) = and ti(<f>F) =1^0. 

The first POVM element E can be written as a sum of one- 
dimensional orthogonal projectors {|/)(/|} (here and in the fol- 
lowing the indices i, k, j, and / take values in appropriate and 
possibly distinct ranges): 

E = ei\t)(i\, with < e, < 1 for all i (A.5) 

i 

(note that the sum is not necessarily a convex combination, 
even though its coefficients are positive and not greater than 
unity). 

We can decompose the first density matrix in an analog- 
ous manner: 

(p = V fa\k){%\, with < fa < 1 for all k and ^ fa = 1, 

k k 

where the projectors are not necessarily parallel or or- 

thogonal to the {|/)(/|}. 

The assumption \i(<f>E) = can be written as 

tr(0E) = J] fae\(k | i)\ 2 = 0, (A.7) 

k,i 

which by the strict positivity of the fa and e,- implies 

(k\i)=0, for all k and i, (A. 8) 

i.e., the projectors — equivalently, the eigenvectors — of the 
matrix E are in fact all orthogonal to those of the matrix 0. 
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This means that the can be completed by the 

and some additional projectors {|/)(/|} to form a complete set 
of orthonormal projectors — an orthonormal basis: 

{\m\k)<h\f)(f\}ikj- (A.9) 
Writing the identity matrix / in terms of the new projectors, 

1 = 2 | ^ |+ Z M ' + Z | "'" >(j " 1 ' (A - 10) 

k i 

we have that the second POVM element F must be given by 
F = / - E = " ed\i){i\ + Yj\k)(k\ + J^l/Kfl (ATI) 

i k j 

Let us now decompose the second density matrix \j/ as 

i]/ = ^ 'A/IWI, with < (A/ < 1 for all / and ^ <A/ = 1 . 

(A.12) 

where the projectors {|/)(/|} do not necessarily belong to the 
complete set previously introduced. 

Using Eqns. (IA. 12i and (IA.1H . we rewrite the assumption 
that tr(ifrF) = as 

WF) = J]Wl-ei)\<t\i')\ 2 + 

l,i 



(A.13) 



U 



Noting that all the sum terms are non-negative, and that the 
coefficients iff/ are strictly positive, we deduce in particular 
that 



(l\k) = 0, for all I and k, 



(A. 14) 



which means that the eigenvectors of the density matrix <f> are 
all orthogonal to those of the matrix 
We have thus 

tr(W) = Y J <PkMl\~k)\ Z = 0, (A. 15) 



k,l 



Q.E.D. 



The converse is also easy to demonstrate. Consider again 



the definitions JA.6t and ( IA. 12i . and assume that these density 
matrices are orthogonal, i.e., Eqn. ( IA. 151 . It follows that their 
eigenvectors are orthogonal, Eqn. JA.14L and may be used to- 
gether with additional projectors {|/)(/|} to form a complete 
orthonormal set 

{\b(h\i}(l\f)(f\}kjj. (a. i6) 

Now define the operators 

E^^jikW, (A. 17) 

k 

F^I~E = + + (A. 18) 



(they are not those used in the previous proof, although we use 
the same symbols for convenience). These operators form, as 
is easily checked, a POVM {E, F], which satisfy Eqns. \AAl . 
If we moreover assume that any POVM may be physically 
realised by some measurement procedure, 28 then we have 
proven the converse statement: if two preparations are rep- 
resented by orthogonal density matrices, it means that they 
are one-shot distinguishable. q.e.d. 
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